131 research outputs found
Likelihood-based Out-of-Distribution Detection with Denoising Diffusion Probabilistic Models
Out-of-Distribution detection between dataset pairs has been extensively
explored with generative models. We show that likelihood-based
Out-of-Distribution detection can be extended to diffusion models by leveraging
the fact that they, like other likelihood-based generative models, are
dramatically affected by the input sample complexity. Currently, all
Out-of-Distribution detection methods with Diffusion Models are
reconstruction-based. We propose a new likelihood ratio for Out-of-Distribution
detection with Deep Denoising Diffusion Models, which we call the Complexity
Corrected Likelihood Ratio. Our likelihood ratio is constructed using Evidence
Lower-Bound evaluations from an individual model at various noising levels. We
present results that are comparable to state-of-the-art Out-of-Distribution
detection methods with generative models.Comment: 9 pages (main paper), 3 pages (acknowledgements & references), 3
figures, 2 tables, 1 algorithm, work accepted for BMVC 202
Hierarchical Subquery Evaluation for Active Learning on a Graph
To train good supervised and semi-supervised object classifiers, it is
critical that we not waste the time of the human experts who are providing the
training labels. Existing active learning strategies can have uneven
performance, being efficient on some datasets but wasteful on others, or
inconsistent just between runs on the same dataset. We propose perplexity based
graph construction and a new hierarchical subquery evaluation algorithm to
combat this variability, and to release the potential of Expected Error
Reduction.
Under some specific circumstances, Expected Error Reduction has been one of
the strongest-performing informativeness criteria for active learning. Until
now, it has also been prohibitively costly to compute for sizeable datasets. We
demonstrate our highly practical algorithm, comparing it to other active
learning measures on classification datasets that vary in sparsity,
dimensionality, and size. Our algorithm is consistent over multiple runs and
achieves high accuracy, while querying the human expert for labels at a
frequency that matches their desired time budget.Comment: CVPR 201
The GAN that Warped: Semantic Attribute Editing with Unpaired Data
Deep neural networks have recently been used to edit images with great
success, in particular for faces. However, they are often limited to only being
able to work at a restricted range of resolutions. Many methods are so flexible
that face edits can often result in an unwanted loss of identity. This work
proposes to learn how to perform semantic image edits through the application
of smooth warp fields. Previous approaches that attempted to use warping for
semantic edits required paired data, i.e. example images of the same subject
with different semantic attributes. In contrast, we employ recent advances in
Generative Adversarial Networks that allow our model to be trained with
unpaired data. We demonstrate face editing at very high resolutions (4k images)
with a single forward pass of a deep network at a lower resolution. We also
show that our edits are substantially better at preserving the subject's
identity
The GAN that warped: semantic attribute editing with unpaired data
Deep neural networks have recently been used to edit images with great success, in particular for faces. However, they are often limited to only being able to work at a restricted range of resolutions. Many methods are so flexible that face edits can often result in an unwanted loss of identity. This work proposes to learn how to perform semantic image edits through the application of smooth warp fields. Previous approaches that attempted to use warping for semantic edits required paired data, i.e. example images of the same subject with different semantic attributes. In contrast, we employ recent advances in Generative Adversarial Networks that allow our model to be trained with unpaired data. We demonstrate face editing at very high resolutions (4k images) with a single forward pass of a deep network at a lower resolution. We also show that our edits are substantially better at preserving the subject's identity. The robustness of our approach is demonstrated by showing plausible image editing results on the Cub200 birds dataset. To our knowledge this has not been previously accomplished, due the challenging nature of the dataset
- …